Representational Interoperability of Linguistic and Collaborative Knowledge Bases

نویسندگان

  • Konstantina Garoufi
  • Iryna Gurevych
چکیده

Creating a Natural Language Processing (NLP) application often requires to access lexical-semantic Knowledge Bases (KBs). Recently, Collaborative Knowledge Bases (CKBs) such as Wikipedia and Wiktionary1 have been recognized as promising lexicalsemantic KBs for NLP (Zesch et al., 2008b), complementing traditional Linguistic Knowledge Bases (LKBs). As CKBs differ significantly from LKBs concerning their content, structure and topological properties, the interoperability between CKBs and LKBs has become a major issue. To address this problem, we have developed a model of representational interoperability between LKBs and CKBs, which abstracts over the differences in their structures, and enables a uniform representation of their content in terms of entities and lexical-semantic relations between them. An entity consists of a set of lexeme–sense pairs along with a part-of-speech (PoS). The currently supported relations are the lexical relations synonymy and antonymy, as well as the semantic relations hypernymy, hyponymy, holonymy, meronymy and other, which covers any lexical-semantic relation other than the previously listed. NLP algorithms can thus be implemented in an one-time effort, as they only have to “know” about generalized entities and relations instead of being adapted to each KB individually. The KBs currently integrated are the LKBs WordNet (Fellbaum, 1998), GermaNet (Kunze, 2004), Cyc (Lenat and Guha, 1989), Roget’s Thesaurus (Jarmasz and Szpakowicz, 2003), Leipzig Annotation Project (Biemann, 2005), and the CKBs Wikipedia and Wiktionary, which are available for a large number of lan-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Role of Common Ontology in Achieving Sharable, Reusable Knowledge Bases

Although AI research and commercial system development depend on bodies of formally represented knowledge that are expensive and diicult to construct, current knowledge base design does not support the accumulation or reuse of such knowledge. This paper presents a strategy for building libraries of sharable, reusable knowledge in which common ontologies play a central role as a knowledge coupli...

متن کامل

Graph-Theoretic Analysis of Collaborative Knowledge Bases in Natural Language Processing

We present a graph-theoretic analysis of the topological structures underlying the collaborative knowledge bases Wikipedia and Wiktionary, which are promising uprising resources in Natural Language Processing. We contrastively compare them to a conventional linguistic knowledge base, and address the issue of how these Social Web knowledge repositories can be best exploited within the Social-Sem...

متن کامل

Collaborative Knowledge Acquisition under Control of a Non-Regression Test System

This paper introduces BeGoood, a generic system for managing non-regression tests on knowledge-bases. BeGoood is a system allowing to define test plans in order to monitor the evolution of knowledgebases. Any system answering queries by providing results in the form of set of strings can be tested with BeGoood. BeGoood has been developed following a REST architecture and is independent of any a...

متن کامل

Interoperability of Corpora and Annotations

This paper describes the application of OWL and RDF to address the interoperability of linguistic corpora and linguistic annotations within such corpora. Interoperability of linguistic corpora involves two aspects: Structural interoperability (annotations of different origin are represented using the same formalism) and conceptual interoperability (annotations of different origin are linked to ...

متن کامل

Building Parameterized Canonical Representations to Achieve Interoperability among Heterogeneous Databases

This paper describes a technique to support interoperable query processing when multiple heterogeneous databases are accessed. We focus on the problem of supporting query transformation transparently , so a user can pose queries locally, without any need of global knowledge about diierent data models and schemas. To support interoperable query transformation, we need to resolve the connicts (i....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008